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Abstract 

Locally finite slanguages, denned via second order quantifications followed by a 
first order locally finite sentence, were introduced by Ressayre in [Res88]. They 
enjoy very nice properties and extend cj-languages accepted by finite automata or 
denned by monadic second order sentences. We study here closure properties of the 
family LOC^ of locally finite omega languages. In particular we show that the class 
LOC u is neither closed under intersection nor under complementation, giving an 
answer to a question of Ressayre [Res89] . 
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1 Introduction 



In the sixties J.R. Biichi was the first to study olanguages recognized by 
finite automata in order to prove the decidability of the monadic second order 
theory of one successor over the integers |Biic62| . In the course of his proof he 
showed that an tu-language, i.e. a set of infinite words over a finite alphabet, is 
accepted by a finite automaton with the now called Biichi acceptance condition 
if and only if it is defined by an (existential) monadic second order sentence. 
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Algorithms have been found to give such an automaton from the monadic 
second order sentence and conversely. Thus the above cited decision problem 
is reduced to the decidability of the emptiness problem for Biichi automata 
which is easily shown to be decidable. The equivalence between definability 
by monadic second order sentences and acceptance by finite automata holds 
also for languages of finite words [Buc60] , and has been extended to languages 
of words of length a, where a is a countable ordinal > u |BS73j . 

The research area, now called "descriptive complexity", found its origin in 
the above cited work of Biichi as well as in the fundamental result of Fagin 
who proved that the class NP is characterized by existential second order 
formulas, |Fag74 . Since then, a lot of work has been achieved about the logical 



definability of classes of formal languages of finite or infinite words, or of 
relational structures like graphs, sec |Fag93 |Pin96j [Tho96] |Imm99j for a 
survey about this field of research. 

Several extensions of existential monadic second order logic over words have 
been studied. 

Lauteman, Schwentick and Therien proved that context free languages are 
characterized by existential second order formulas in which the second order 
quantifiers bear only on matchings, i.e. pairing relations without crossover, 
[L5T94] . 

Parigot and Pelz, and more recently Yamasaki, extended monadic second order 
logic with two second order relation symbols and characterized classes of Petri 
net (u)-languages [PP85] [Pel87] |Yam99j . 

Eiter, Gottlob and Gurevich studied the relationship between monadic second 
order logic and syntactic fragments of existential second order logic over (fi- 
nite) words [EGGOOJ. Distinguishing prefix classes, they determined which of 
them define only regular languages and which of them have the same expres- 
sive power as monadic second order logic. 

Another extension has been introduced by Ressayre, in order to apply some 
powerful tools of model theory to the study of formal languages, |Res88] . He 
defined locally finite sentences (firstly called local). A locally finite sentence 
ip is a first order sentence which is equivalent to a universal one and whose 
models satisfy simple structural properties: closure under functions takes a 
finite number n v of steps. 

These syntaxic and semantic restrictions allow a meaningful use of the notion 
of indiscernables and lead to beautiful stretching theorems connecting the 
existence of some well ordered infinite models of 95 with the existence of some 
finite models generated by indiscernables [FR96J. 

Locally finite languages are defined by second order formulas in the form 
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3R 3f ip where <p is a locally finite sentence and R (respectively, /) represent 
the relation (respectively, function) symbols in the signature of tp. 
These second order quantifications are much more general than the monadic 
ones as the following results show: 

• Each regular language is locally finite, |Res88j . and many context free as 
well as non context free languages are locally finite [FinOlj . 

• Each regular a»-language is a locally finite u;-language, |Fin01] |Fin89] . and 
there exist many more locally finite cj-languages as we shall see below. 

• This result is extended to languages of transfinite length words: if a is an 
ordinal < cu^, each regular a-language is also locally finite [FinOlj . 

But a pumping lemma, following from a stretching theorem, makes locally 
finite tu-languages keep important properties of regular cu-languages, [Res88j 
[FR96J. It is an analogue for each locally finite u;-language of the property: 

" A regular cu-language is non empty if and only if it contains an ultimately 

periodic word " . 

This lemma implies in a similar manner the decidability of the emptiness prob- 
lem for locally finite ^-languages. Moreover for each countable ordinal a < oj u , 
the decidability of the emptiness problem for locally finite a-languages follows 
from similar arguments, [FR96J. 

Other decidability results, as the decidability of the problem: "is a given 
finitary locally finite language infinite?" follow from stretching theorems of 
|Res88|[FR96] . 

These interesting properties of locally finite languages naturally lead to the 
question of the richness of the class of locally finite languages: how large is 
this class? What are its closure properties? 

The study of locally finite languages of finite words was begun by Ressayre in 
|Res88j and continued in [FinOlj . We focus in this paper on the class LOC u of 
locally finite cu-languages and study classical closure properties for this class. 
In particular, we show that LOC^ is neither closed under intersection, nor 
under complementation. The proof uses the notion of rational cone of finitary 
languages which is important in formal language theory and the notion of 
indiscernables in a structure, often used in model theory. 
This gives an answer to a question of Ressayre, [Res89j . Of course we would 
have preferred a positive answer to this question which would have provided 
a useful class of sentences for specification and verification of properties of 
non-terminating systems. But this leaves still open, for further study, the 
possibility to find such a useful class of sentences as a subclass of the class of 
locally finite sentences. 

In section 2, we give the first definitions and some examples of locally finite 
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u;-languages. In section 3, closure properties for cu-languages are investigated. 
We show that the class LOC u is not closed under intersection with regu- 
lar cj-languages thus LOC w is neither closed under intersection, nor under 
complementation (because LOC w is closed under union). Then we prove that 
LOCu is closed under A-free morphism and A-free substitution of locally finite 
(finitary) languages. 



2 Definitions and examples 

2. 1 Definitions 

We briefly indicate now some basic facts about first order logic and model 
theory. See for example [CK73] for more background on this subject. 

We consider here formulas of first order logic. The language of first order 
logic contains (first order) variables x, y, z, ... ranging over elements of a 
structure, logical symbols: the connectives A (and), V (or), — > (implication), 
-i (negation), and the quantifiers V (for all), and 3 (there exists), and also 
the binary predicate symbol of identity =. 

A signature is a set of constant, relation ( different from = ) and function 
symbols, we shall consider here only finite signatures. 

Let Sig be a finite signature. We define firstly the set of terms in the signature 
Sig which is built inductively as follows: 

(1) A variable is a term. 

(2) A constant symbol is a term. 

(3) If F is a m-ary function symbol and ti, • • • , t m are terms, then F(t\, . . . , t m ) 
is a term. 

We then define the set of atomic formulas which are in the form given below: 

(1) If ti and £2 are terms, then t\ = £2 is an atomic formula. 

(2) If ti, t 2 , . . . , t m are terms and R is a m-ary relation symbol, then R(ti, . . . , t m ) 
is an atomic formula. 

Finally the set of formulas is built inductively from atomic formulas as follows: 

(1) An atomic formula is a formula. 

(2) If (p and ip are formulas, then (p Atp, <p\/ tp, (p — *■ ip and -up are formulas. 

(3) If x is a variable and <p is a formula, then \/xtp and 3x(p are formulas. 

An open formula is a formula with no quantifier. 
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We assume the reader to know the notion of free and bound occurrences of a 
variable in a formula. Then a sentence is a formula with no free variable. 
A sentence in prenex normal form is in the form tp = Qxx% . . . Q n x n po(xi, . . . , x n ), 
where each Qi is either the quantifier V or the quantifier 3 and the formula ipo 
is an open formula. 

It is well known that every sentence is equivalent to a sentence written in 
prenex normal form. 

A sentence is said to be universal if it is in prenex normal form and each 
quantifier is the universal quantifier V. 

We then recall the notion of a structure in a signature Sig: A structure is in 
the form: 

M=(\M\,(a M ) a£Sig ) 
Where \M\ is a set called the universe of the structure, and for a G Sig, a M 
is the interpretation of a in M: 

If / is a m-ary function symbol in Sig, then / is a function: M m — > M. 
If R is a m-ary relation symbol in Sig, then R is a relation: R M C M m . 
If a is a constant symbol in Sig, then a M is a distinguished element in M. 

In order to simplify the notations we shall sometimes write a instead of a M 
when the meaning is clear from the context. 

When M is a structure and tp is a sentence in the same signature Sig, we 
write M \= ip for " M is a model of ip " , which means that <p is satisfied in the 
structure M. A detailed exposition of these notions may be found in [CK73] . 

When M is a structure in the signature Sig and Sigi is another signature such 
that Sigi C Sig, then the reduction of M to the signature Sigi is denoted 
M\Sig\. It is a structure in the signature Sigi which has same universe \M\ as 
M , and the same interpretations for symbols in Sigi. Conversely an expansion 
of a structure M in the signature Sigi to a structure in the signature Sig has 
same universe as M and same interpretations for symbols in Sigi. 

When M is a structure in a signature Sig and X C \M\, we define: 

d l (X, M) = X U IJ{/ n-ary function of Sig } f M {X U ) U U{a constant of Sig } flM 

cl n+1 (X, M) = cl l {cl n {X, M), M) for an integer n > 1 
and d(X, M) = U n >i cl n (X, M) is the closure of X in M. 

Let us now define locally finite sentences. We shall denote S(y?) the signature 
of a first order sentence <p, i.e. the set of non logical symbols appearing in tp. 

Definition 2.1 A first order sentence p is locally finite if and only if: 

a) M \= tp and X C |M| imply cl(X, M) |= p 

b) 3n G N such that VM ; if M (= <p and X C |Af|, t/ien cl{X,M) = 
cl n (X,M) (closure in models of ip takes less than n steps). 
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Notation. For a locally finite sentence tp, we shall denote by n v the smallest 
integer n > 1 satisfying b) of the above definition. 

Remark 2.2 Because of a) of Definition \2.1\ a locally finite sentence <p is 
always equivalent to a universal sentence. 

We now introduce some notations for finite or infinite words. 
Let E be a finite alphabet whose elements are called letters. A finite word 
over E is a finite sequence of letters: x = Oq . . . a n where Vz G [0; n] tXj G E. 
We shall denote x(i) = ai the i + 1 th letter of x and x[i] = x(0) . . .x(i) for 
i < n. The length of x is \x\ — n + 1. The empty word will be denoted by A 
and has letter. Its length is 0. The set of finite words over E is denoted E*. 
E + = E* — {A} is the set of non-empty words over E. A (finitary) language 
L over E is a subset of E*. Its complement (in E*) is L~ = E* — L. The 
usual concatenation product of u and v will be denoted by u.v or just uv. 
The set of non negative integers is denoted by N. For K C S*, we denote 
V* = { Vl . . . v n | Vz' G [1; n\ ViGV } U {A}. 

The first infinite ordinal is u. 

An cu-word over E is an w-sequence a$ai . . . a n . . ., where Vz > a 8 6 S. 
For o G S w , cr(n) is the n + 1 th letter of a and a[n) = o~(0)o~(l) . . . a(n). 
The set of u- words over the alphabet E is denoted by E w . An cu-language 
over E is a subset of E w . The a;-power of a finitary language V C E* is the 
cj-language V w = {a = u\ . . . u n . . . G E w | Vz > 1 Ui G V}. 
For a subset A C E w , the complement of A (in E w ) is E w — A denoted A~ . 
The concatenation product is extended to the product of a finite word u and 
an k>word v: the infinite word u.v is then the tu-word satisfying: 
(u.v)(k) = u{k) if k < \u\ , and (u.v)(k) = v(k — \u\) if k > \u\. 

A word over E may be considered as a structure in the usual manner: 

Let E be a finite alphabet. For each letter a G E we denote P a a unary 
predicate and A s the signature {<, (-P a )aes}- The length |er| of a non-empty 
finite word o G E* may be written |er| = {0, 1, . . . , \o~\ — 1}. er is identified to 
the structure (|a|, <°", (P^") ag s) of signature As where 
P° = {i < \cr\ | the i + 1 th letter of a is an a}. 

In a similar manner if cr is an tu-word over the alphabet E, then u is the length 
of the word a and we may write \a\ = u = {0, 1, 2, 3, . . .}. a is identified to 
the structure (u, < a , (P^aes) of signature A s where 
= {z < ijj | the z + l* h letter of a is an a}. 

Definition 2.3 Lei H be a finite alphabet and LCE*. 
T/ien /L z's a locally finite language ] < — ► 

there exists a locally finite sentence if in a signature A D A 2 such that a G L 



6 



iff 3M, M \= Lp and M|A S = a}. 

We then denote L = £ s (<^) and, if there is no ambiguity, L = L(ip) the locally 
finite language defined by tp. 

The class of locally finite languages will be denoted LOC . 

The empty word A has letters. It is represented by the empty structure. 
Recall that if L(cp) is a locally finite language then L((p) — {A} and L(<p) U {A} 
are also locally finite [FinOlj . 

Definition 2.4 Let E be a finite alphabet and LCE U . 

Then [ L is a locally finite u-language } < — ► { there exists a locally finite 
sentence <p in a signature A D such that Vcr G X" a G L iff 3M, M \= <p 
and M|A S = a ]. 

We then denote L = L^(<p), and, if there is no ambiguity, L = L^{(p) the 
locally finite u-language defined by <p. 

The class of locally finite u-languages will be denoted LOC^. 

Remark 2.5 The notion of locally finite (u)-language is very different of the 
usual notion of local (to )-language which represents a subclass of the class of ra- 
tional (ui) -languages. But from now on, as in IFinOlf , things being well defined 
and precised, we shall call simply local (uo) -languages (respectively, local sen- 
tences) the locally finite (u) -languages (respectively, locally finite sentences). 

2.2 Examples of local uj -languages 

The following example should not be skipped because it is crucial to Theorem 
EH below. 

Example 2.6 The u-language which contains only the word a = abab 2 ab 3 ab 4 . . . 
(where the i-th occurrence of a is followed by the factor b l a) is a local uj- 
language over the alphabet {a, b}. 

Proof. Let the signature S(ip) = {P a , Pb, <,p,p', /}, where p and p' are unary 
function symbols, / is a binary function symbol. And let if be the following 
sentence, conjunction of: 

(1) "ixyz[(x < yVy < x)A(x < yAy < x «-> x = y)A(x < yAy < z —>■ x < z)], 

(2) Vx[P a (x) <-> -P 6 (x)], 

(3) Vxy[(x < y A P a (x) A P a (y)) - P b (f(xy))}, 

(4) Wxy[x > y — » f(xy) = x), 

(5) Wxy[^P a (x) V -.P (y) ^ f(xy) = x], 

(6) Vx[P a (p(x))AP a (p'(x))\, 

(7) Vx[^P a (x) —> p(x) <p'(x)}, 

(8) Vx[P a (x) — > p(x) = p'(x) = x\, 
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(9) Vx[^P a (x)^x = f(p(x)p>(x))}, 

(10) Vxx'y G P a [x < x' < y -> /(afy) < /(^) < y], 

(11) Vrryy' G P [a; <y' <y -> y' < /(xy) < j/]. 

Above sentence (1) means that "< is a linear order ". The sentence (2) ex- 
presses that P a , Pb form a partition in any model M of </?. The sentence (3) 
says that / is a function from {(x,y) \ x < y A P a (^) A P a (y)} into p, while 
sentences (4)- (5) express that the function / is trivially defined elsewhere. 
Sentences (6)-(7) say that p and p' are functions defined from -P a = P b into 
P a and (8) states that p and p' are trivially defined on P a . 
The projections p and p' are used to say that the function / is surjective from 
{(x,y) | x < y A P a (^) A P a (y)} onto p,; this is implied by sentence (9). 
The 10 th and 11 th conjunctions are used to order the elements of f(P a x P a ) 
in order to obtain the word a = abab 2 ab 3 ab 4 . . . when the reduction to the 
signature of words is considered. 

Notice that (10)-(11) imply also that / is injective hence is in fact a bijection 
from {(x, y) \ x < y A P a (x) A P a (y)} into p,. □ 

Remark 2.7 We have defined the functions f and p, p' , in a trivial manner 
(like f(xy) = x or p(x) = x) where they were not useful for defining the local 
uj-language {c}, (see the conjunctions (4), (5) and (8)). This will imply that 
closure in models of tp takes at most a finite number of steps. This method 
will be applied in the construction of other local sentences in the sequel of this 
paper. 

We can easily check that tp is equivalent to a universal formula and that closure 
in its models takes at most n v = 2 steps: one takes closure under the functions 
p,p' then by /. 

Hence tp is a local sentence and by construction: L^ b ^(tp) = {abab 2 ab 3 . . .}. 

Let us give some examples of closure in a model M of tp such that M|A{ a fe } = 
a = abab 2 ab 3 ab 4 . . . 

Let X n C \M\ be the segment of M corresponding to the segment ab n a of 
a. Then the closure of X n under the functions p, p' is the set X n U Z n where 
Z n corresponds to the set of the (n — 1) first letters a of a. cl(X n) M) is the 
closure of X n U Z n under / and it is the initial segment of M corresponding 
to the initial segment abab 2 ab 3 ab 4 . . . ab n a of a. 

Let now Y C \M\ be the segment of M corresponding to the three last letters 
b 2 a of the segment ab n a of a, for some integer n > 3. Then the closure of Y 
under the functions p, p' is Y U Z where Z corresponds to the set of the two 
first letters a of a. The closure of Y U Z under / is the set cl(Y,M) which 
induces the word abab 2 a but which is not a segment of M because it contains 
the two first letters a and the (n + l)-th letter a of a but not any other letter 
a of a. 
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We are going now to get more examples of local cu-languages. Recall first the 
following: 

Definition 2.8 The uo-Kleene closure of a family C of finitary languages is: 
u-KC(C) = {U^itW I V* G [1, n] Ui, Vi G £} 



This notion of cu-Kleene closure appears in the characterization of the class 
REG^ of regular cu-languages (respectively, of the class CF U of context free uj- 
languages) which is the cu-Kleene closure of the family REG of regular finitary 
languages (respectively, of the family CF of context free finitary languages), 
|Tho90j [PP02] [Sta97j . 

A natural question arises: does a similar characterization hold for local lan- 
guages? The answer is given by the following: 

Theorem 2.9 The u-Kleene closure of the class LOC of finitary local lan- 
guages is strictly included into the class LOC w of local uj-languages. 

Proof. We have already proved that u-KC(LOC) C LOC,, in jFinOl] . In 
order to show that the inclusion is strict, remark that if an tu-language L 
belongs to u-KC (LOC) , then L contains at least an ultimately periodic word, 
i.e. a word in the form u.v w where u and v are finite words. Now we can easily 
check that the local ^-language given in example 12.61 does not contain any 
ultimately periodic word because its single word is not ultimately periodic. □ 

A first consequence of Theoreme 12.91 is that every regular tu-language is a 
local cu-language, i.e. REG^ C LOC^, because every finitary regular language 
is local [Res88j . 

We had shown in |Fin01j that many context free languages are local thus 
CF^ = uj-KC(CF) implies that many context free cu-languages are local. 
The problem to know whether each context free language is local is still open 
but by Theorem [2H CF C LOC would imply that CF W C LOC w . 

The tu-language given in example 12.61 is local but non context free because 
every context free a;-language contains at least one ultimately periodic word. 
We proved in [FinOl] that the finitary language U = {a n b n | n > 1} C {a, b}* 
is local. Thus the ^-language U.c^ C {a, b, c} u is local but U.& is not context 
free because U i CF, [CG77] . 

These two examples show that the inclusion LOC u C CF^ does not hold. 
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3 Closure properties of locally finite omega languages 



Theorem 3.1 The class of locally finite omega languages is not closed under 
intersection with a regular co-language in the form L.a w where L is a rational 
language, ICS* and a ^ E. Hence LOC w is neither closed under intersection 
nor under complementation. 

To prove this theorem, we shall proceed by successive lemmas. We shall as- 
sume that every language considered here is constituted of words over a finite 
alphabet included in a given countable set E/> 

We firstly define the family / of finitary languages by: for a finitary language 
L C E*, where E C E^ is a finite alphabet, L G / if and only if L.a^ G LOC^ 
whenever a is a letter of E^ — E. 

It is easy to see that if L.a^ is a local w-language and a G E D — E, then for 
all b G So — E, it holds that L.b u G LOC w . It suffices to replace the predicate 
P a by Pb in the sentence defining L.a u . 

Lemma 3.2 I is closed under inverse alphabetic morphism. 

Proof. Let Lei, i.e. ICP for some finite alphabet T, a T and L.a^ = 
L^ u ^(p) for a local sentence ip. 

Let h be the alphabetic morphism: E* — > T* , defined by h(c) G T U {A} for 
c G E, where A is the empty word. And let E' = {c G E | h(c) = A}. We 
assume, possibly changing a, that a E. 
We first replace in p the letter predicates (P c ) ce r by (Q c ) c er- 

The language /i^ 1 (L).a w is then defined by the following sentence vp, in the 
signature S(ip) = {P, A, (P c ) ce s, Pi} U S(tp), where S(<p) contains the letter 
predicates Q c for c G T U {a}, P is a unary predicate symbol and A is a 
constant symbol. ^ is the conjunction of: 

• ( < is a linear order ), 

• ((P c ) c6S ,P a ) form a partition, 

• Vxi...x n G P[</?o(zi...£n) A A cer (Qc(a:i) <-> Vdeh-i(c) p d{xi)) A (Qafai) ^ 
P a (a;i))], where ip = \/xi...x n p>Q(xi...x n ) with <p an open formula, 

• \/xi...Xk[(SI \<j<k^P( x j)) ~^ g(xi...Xk) = Xi] , for each k-ary function g of 

S{p), 

• (P c ) c gs' form a partition of -P, 

• P(B), for each constant B of S(tp), 

• VxyhP a (y) AP a (i)-»i/<i], 

• P«(A). 



10 



ip is equivalent to a universal sentence and closure in its models takes at most 
+ 1 steps. Hence ip is local and by construction L^ u ^(ip) = h~ l (L).a^ .□ 

Lemma 3.3 I is closed under non erasing alphabetic morphism. 

Recall that an alphabetic morphism h : X* —>■ T*, defined by h(c) G T U {A}, 
for c 6 E is said to be non erasing if Vc G S, h(c) G T. 

Proof. Let L G I,L C £*. Let a £ £ and L.a w = L^ u ^(^) for a local 
sentence <£>. Let /i be a non erasing alphabetic morphism given by h : £ — ► T. 
Moreover we assume that a ^ T (possibly changing a). Then the language 
h(L).a w is defined by the following formula ip, in the signature S(ip) = S(ip) U 
{(<5c)cer}- The sentence ^ is the conjunction of: 

• vx[A ceE (p c (x) -> g Mc) (x))], 

• [(Qc)cer, {Pa)] form a partition. 



ip is local and if the predicates (Q c )cer, P a , are the letter predicates, ip defines 
the w-language {ip) = h{L) .a" . □ 

Lemma 3.4 / contains the finitary local languages. 

Proof. LOC w contains the omega Kleene closure of the class LOC of finitary 
local languages, and for each letter a the language {a} is local. □ 

Recall now the definitions of the Antidyck language and of a rational cone of 
languages. 

Definition 3.5 The Antidyck language over two sorts of parentheses is the 
language Q% = {v G (Y U Y)* \ v ^* A} ; where Y = {y l ^y 2 } ! Y = {yi,y 2 } 
and — >* is the transitive closure of — > defined in (Y U Y)* by: 
Vy G Y yv\yv2 — > v\Vi if and only if v% G V*. 

The Antidyck language Q% may be seen as the language containing words 
with two sorts of parentheses, such that: "the first parenthesis to be opened 
is the first to be closed" 

Definition 3.6 ( | Ber79] ) A rational cone is a class of languages which is 
closed under morphism, inverse morphism, and intersection with a rational 
language. ( Or, equivalently to these three properties, closed under rational 
transduction). 

The notion of rational cone has been much studied. In particular the Antidyck 
language Q 2 * is a generator of the rational cone of the recursively enumerable 
languages, [FZV80] . 
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On the other hand Nivat's Theorem states that a class of languages which is 
closed under alphabetic morphism, inverse alphabetic morphism, and inter- 
section with a rational language, is a rational cone, |Ber79j . Moreover every 
rational transduction t is in the form t(u) = gfo -1 ^) D R], where g et h are 
alphabetic morphisms and R is a rational language. Thus every recursively 
enumerable language may be written in the form g[h~ l (Q'2) PI R], where R is 
a rational language, g and h are alphabetic morphisms. 
This result will be used here because the language Q'£ is local, |Fin01j . 

Return now to the proof of Theorem 13.11 and suppose that LOC^ were closed 
under intersection with the languages R.a u , where R C E* is a rational lan- 
guage and a ^ E. 

Claim 3.7 I would be closed under intersection with a rational language. 

Proof. Let L G I, L C E*, R C E* be a rational language and a ^ E. L.a^ is 
a local cu-language because L G / and (L.a aJ ) R (R.a^) — (L fl R).^ would 
be a local tu-language, hence by definition of I, Lf} R would belong to /. □ 

Claim 3.8 / would contain every language in the form g[h~ 1 (Q , 2)r\R], where 
h is an alphabetic morphism, g is a non erasing alphabetic morphism and R 
is a rational language. 

Proof. It follows from the lemmas 13.21 13.31 13.41 the fact that Q' 2 * is local and 
Claim [321 □ 

Claim 3.9 There would exist an erasing alphabetic morphism h and L G / 
such that {0 n l p | p > 2™} = h(L), where an alphabetic morphism is said to 
be erasing if it is in the form h : S — > S U {A}, with h(c) = c if c G A and 
h(c) = A if c G X — A, for some subset ACE. 

Proof. We know that every recursively enumerable language may be written 
in the form 5 , [/i _1 (Q / 2 *) H R], where R is a rational language, g and h are 
alphabetic morphisms. 

But every alphabetic morphism may be obtained as composed firstly by a non 
erasing alphabetic morphism followed by an erasing alphabetic morphism. 
Thus it follows from Claim I3TB1 that each recursively enumerable language, and 
in particular the language {0 n l p | p > 2™}, would be the image by an erasing 
alphabetic morphism of a language of /. □ 

Let then h be an erasing morphism E* — > E* where {0, 1} C E, h(0) = and 
h(l) = 1 and h(c) = A if c G E — {0, 1} and let L C E* be a language such 
that h(L) = {0 n F | p > 2 n }. 

Assume that L belongs to /, so if a ^ E, L.a u is a local cu-language and there 
exists a local sentence ip such that L.a^ = L^ u ^ (</>). 

For all n > 1 let M n be a model of ip of order type u such that M n \ A SU { a } = 
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a n .a? ', where <r n G L and the number of occurrences of in <r n is n and the 

number of occurrences of 1 in a n is p n > 2" . 

Let us now set the following definition in view of next lemma. 

Definition 3.10 Let X be a set included in a structure M and P C \M\. X is 
a set of indiscernables above P for the atomic formulas of complexity < k, i.e. 
whose terms result by at most k applications of function symbols, for k e N, 
if and only if: 

i) X is linearly ordered by <. 

ii) Whenever x and y are some n-tuples of elements of X which are isomor- 
phic for the order of (X, <), x and y satisfy the same atomic formulas of 
complexity < k, with parameters in P. 

Lemma 3.11 In the above conditions where M n is defined for every integer 
n > 1, there exists in M n an infinite set X n of indiscernables above P M " for 
the atomic formulas of complexity < n^, with X n C P* /n 

Proof. Let m(ip) be the maximum number of variables of the atomic formulas 
of complexity < n v i.e. whose terms result by at most n v applications of 
function symbols. These terms form a finite set . 

For all strictly increasing sequences x and y of length m((p) of P^ In , let us 
set x ~ y if and only if x and y satisfy in M n the same atomic formulas with 
parameters in Pq 1 " and of complexity < n v . 

P^ 1 " is a finite set of cardinal n, hence the set of atomic formulas with param- 
eters in P Mn and of complexity < n v is also finite. 

Then applying the infinite Ramsey Theorem, we can find X n C P^ In homoge- 
neous for ~ and infinite. This is the set we are looking for. □ 

We return now to the proof of Theorem 13.11 and consider in | M n | the subset 
X n U Pq 1 " = Y n . This subset is infinite hence it is of order type uo in M n and 
it generates in M n a model of order type uo too, which will be denoted by 
M n (Y n ) = A n . 

This model of (p induces a word u„.a w of L.a^, such that there are in u n : 
n occurrences of the letter and q n < p n occurrences of the letter 1. But 
u n .a? e L.a^ implies that 2 n < q n also holds. 

A n is generated from Y n by the use of only a finite set of terms of less 
than k^ variables. If n is big enough with regard to k v and card(T ¥> ), be- 
cause q n > 2™, there exist parameters a±, . . . , a^, elements of P n , and some 
indiscernables xi,...,Xj, and y±, . . . , yj of X n , such that x\ < . . . < Xj and 
yi < . . . < yj and x ^ y and a term t G T v such that t(ai, . . . , a^, Xi, . . . , Xj) < 



13 



t(a\, . . . , a*;, 2/1, . . . , yj) and these two elements being in P 1 ". 

But then we could find in X n a sequence {xi)i<i<N, with iV arbitrarily large, 
such that for each i, 1 < % < N, is of the order type of xy. 

Then for all integers i such that 1 < i < N, P^ n {t(a\, . . . ,ak,Xi)) and the 
terms £(ai, . . . , a^, X{) are distinct two by two. This would imply that, for all 
integers N > 1, card(P 1 M ") > N. So there would be a contradiction with 
card(P ] Afn ) = p n and we have proved that L does not belong to /. 

Thus we can infer Theorem 13 . II from Claim [331 The non closure under comple- 
mentation of the class LOC^ can be deduced from the non closure under inter- 
section and the fact that LOC^ is closed under union (see next Theorem) or 
from an example, deduced from preceding proof, of a local u-language which 
complement is not local (see next remark). □ 

Remark 3.12 The above proof shows in particular that the o;-language A = 
{0 n l p 2 UJ | p > 2™} is not local. From which we can easily deduce that the local 
cu-language {0 n l p 2 UJ | p < 2 n } = L has a complement which is not a local 
a»-language. (This cu-language L is local because {0 n P | p < 2 n } is a local 
finitary language |Fin01j ). 

Indeed if its complement was cj-local, we would deduce, from a local sentence 
ip such that L w {ip) = L~, another local sentence ip such that L u (ip) = A. 
For example the sentence if), conjunction of: 

• ¥ , 

• Vxy[(P (x) A Pi(y)) -> x < y] , 

• Vzy[(Pi(z) A P 2 {y)) -> x < y\ , 

• Vxy[(P (x) A P 2 (y)) -> x < y\ , 

• Pi{c), where c is a new constant symbol. 

Now we establish that LOC^ is closed under several operations. 

Theorem 3.13 The class LOC^ is closed under union, left concatenation 
with local (finitary) languages, X-free substitution of local (finitary) languages, 
X-free morphism. 

Proof. 

Closure under union. 

Let cpi and <p 2 be two local sentences defining local tu-languages L w (y?i) and 
Loj(<f2) over a finite alphabet E. Let us define a new local sentence <pi U ip 2 
which defines the local (^-language L w (ipi U (p 2 ) = L u (<fi) U L L0 {^p 2 ): 

We may assume that S(ipi) fl S(ip 2 ) = As. Then S(ipi U <p 2 ) will be S(ipi) U 
S(ip 2 ). And the sentence <pi U <p 2 is the following sentence: 
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[Vl An— ary function symbol f&S(ip2)(^ X li ■ ■ ■ i x nf{ x lj • • • 5 x n) TTlin{x\ } . . . , X n ))] 
V[V 9 2 An-ary function symbol geS((p 2 ) O^El, • • • ) x n,g( x l, ■ ■ ■ , x n) = min(xi, . . . , X n ))\ 



Closure under left concatenation by a local finitary language. 

Consider a finitary local language L(<p) and a local ^-language L u (tl)) over 
the same alphabet T. 

We may easily assume that L^ip) is empty, possibly adding a constant symbol 

c to the signature of ip and adding the conjunction Vx[x < c] to the sentence 

if (this means that every model of ip has a greatest element). 

We may also assume that S((p) PI S^VO = {<, (P a ) ag r} = Ar- 

Let then P be a new unary predicate symbol not in S(<p) U S(ip), and let tp.ip 

be the following sentence in the signature S(f>) U S(if)) U {P}, which is the 

conjunction of: 

• (< is a linear order ), 

• ((-fa)aer form a partition), 

• Wxy[P(x) A ~>P(y) —* x <y\, 

• Vaci, . . . ,Xj G P[(f {x u . . . , Xj)), 

where 90 = Vxi, . . . , Xjipo(xi, . . . , Xj) with <^ an open formula, 

• Vxi, . . . ,x m e P[f(xi, . . .,x m ) e P], 

for each m-ary function / of S((p), 

• Vxi, . . . , i m [Vi<i< m ~~*P{, x i) > fipii ■ ■ ■ > x m) = rn,in{x\, . . . , x m )], 
for each m-ary function / of S((p), 

• P(c), for each constant c of S(ip), 

• Vsci, . . . ,Xj G -P[V>o(^i, • • • , Zj)], 

where ?/> = Vxi, . . . , Xjipo(xi, . . . , Xj) with ^ an open formula, 

• Vzi, . . . , x m G -iP[/(aci, . . . ,x m ) G -iP], 
for each m-ary function / of S(ijj), 

• Vxi, . . . , a; m [Vi<!<m P\ x i) * f\ x i-> • • • 1 x m) = niin(xi, . . . , x m )], 
for each m-ary function / of S(i/j), 

• -P(c), for each constant c of S(ip), 

This sentence is equivalent to a universal formula and closure in its models 
takes at most max{n v , n$) steps, hence it is a local sentence and by construc- 
tion it holds that L(ip.ip) = L((p).L(ip). Moreover when cu-words are considered 
Luiip.ip) = L((p).L w (i(}) holds because by hypothesis L^ip) is empty. 



Closure under A-free substitution of local languages. 

The proof is very similar to our proof of the closure of the class LOC under 
substitution by local finitary languages in [FinOlj . We recall it now. 
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Recall first the notion of substitution: 

A substitution / is defined by a mapping E — > P(T*), where E = {a 1; a n } 
and T are two finite alphabets, / : — > Li where Vz G [l',n], Li is a finitary 
language over the alphabet T. The substitution is said to be A-free if Vz G [1; n], 
Lj does not contain the empty word A. It is a (A-free) morphism when every 
language Lj contains only one (nonempty) word. 

Now this mapping is extended in the usual manner to finite words and to 
finitary languages: for some letters x(0), . . . , x(n) in E, /(x(O)x(l) . . . x{n)) = 
{u Q Ui . . .u n | Vz G [0; n) G f(x(i))}, and for L C £*, /(L) = U xe i,/(a;). 
If the substitution / is A-free, we can extend this to cu-words and o;-languages: 
f(x(0)x(l) . . . x(n) . . .) = {uqU\ . . .u n . . . | Vz > Ui G f(x(i))} and for 
LCV>,f{L)=U xeL f{x). 

Let then E = {a%, . . . , a n } be a finite alphabet and let / be a A-free substitution 
of local languages: E — > P(T*), a { — > Li where Vz G [l',n], Li is a local 
language defined by the sentence (fi, over the alphabet T. We may assume 
that L w (ipi) is empty, possibly adding a constant symbol q to the signature of 
ifi and adding the conjunction Vx[x < cj to the sentence <£>j (this means that 
every model of (fi has a greatest element). We also assume that the signatures 
of the sentences ipi verify S(<pi) D S((pj) = {<,(P a ) ae r} for i ^ j. Let now 
L C E w be a local tu-language defined by a local sentence if. We shall denote 
the unary predicate of S(<p) which indicates the places of the letters a% in a 
word of L, so if a,i G THE for some indice z, there will be two distinct predicates 
Q ai and P ar We may also assume that Vz G [1, . . . , n], S^V 9 *) fl S(ip) = {<}■ 
Then we can construct a local sentence ip (already given in |Fin01j ) such that 
/-•('••) /(/-)• 

ip is the conjunction of the following sentences, which meaning is explained 
below: 

• " < is a linear order " , 

• Vxy[(I(y) <y)A(y<x^ I(y) < I(x)) A (I(y) <x<y^ I(x) = I(y))}, 

• Vx[/(x) = x <-> P(x)], 

• P(c), for each constant c of S(ip), 

• Vxi, . . . , Xjfc[-R(xi, . . . , x fc ) -> P(xi) A ... A P{x k )], 
for each predicate R(xi, . . . , x^) of £(</?), 

• Vxi, . . . , x j [{P{x 1 ) A ... A P{x 3 )) -> P(/(x!, . . . , x,-))], 
for each j-ary function symbol f of S(<p), 

• Vxi, . . . ,Xj[(Vi<i<j /(^i' • • • = m ™{x x , . . . ,xj)], 
for each j-ary function symbol f of S(ip), 

• Vxi, . . . , x m [(P(xi) A ... A P{x m )) -> ^o(^i, • ■ • , SCm)]> 

where = Vxi, . . . , x m <£> (xi, • • • , x m ) with yj an open formula, 

• Vxi, . . . ,x j [Vi i fc<j(/(x i ) ^ I{x k )) -> /(an, . . . ,Xj) = min(xx, . . .,x,)], 
for every function f of S^z) for an integer I < n, 

• Vxyi,---,Vj[(Ai<i<ji(yi) = H x )) -» Hfivi,- ■■,%•)) = / 0)]> 

for each j-ary function symbol f of S((pi) for an integer z < n, 
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Finally, for each i < n: 
• Vxy 1 ,...,y p [(h 1 < l < p I(yi) = I(x) A Q ai (I(x))) -> . . . , y p ) A {(e^yi) = 

J (» A • • • , y P ) = yi A ^Rj(yi, . . . ,y p ) A A^es^) e i(2/i) = e*(aO; wh ere 

n > j ^ i, and ej,fj,Rj run over the constants, functions, and predicates 
ofSfo-)}], 

Above , 1) to each constant e/ of S((pi) is associated a new unary function 
ej(y) and 2) whenever = Vyi, . . . , y p ipi(yi, . . . , y p ) with ^ an open formula, 
(pi is ipi in which every constant has been replaced by the function ej(y). 

Construction of ^ : 

Using the function / which marks the first letters of the subwords, we divide 
an a>word into omega (finite) subwords (the function / is constant on each 
subword and I{x) is the first letter of the subword containing x). In every 
model M of order type u of ip, the set of the "first letters of subwords" , 
P M , grows richer in a model of order type to of tp (therefore will constitute an 
cj-word of L). 

Then we "substitute": for each letter a, in P M , we substitute a (finite) word 
of Lj, using for that the formula </?j. 

If closure takes at most n v (respectively n v J steps in every model of tp (re- 
spectively of ifii), then closure takes at most [1 + + supi(n Vi )] steps in each 
model of tp (one takes closure under the function /, then under the functions 
of S(ip), and finally under the functions of S(ipi), 1 < i < n). 
Therefore tp is a local sentence and by construction tp defines the cu-language 
f(L). 



Closure under A-free morphism. 

It is just a particular case of the preceding one, when every language Lj con- 
tains only one non-empty finite word. □ 



Acknowledgments. Thanks to the anonymous referees for useful comments 
on the preliminary version of this paper. 
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